viruses (MbPI) 


Article 

Novel Bat Alphacoronaviruses in Southern China 
Support Chinese Horseshoe Bats as an Important 
Reservoir for Potential Novel Coronaviruses 


Susanna K.P. Lau 1234+, Antonio C.P. Wong "+, Libao Zhang 5+, Hayes K.H. Luk 1, 
Jamie S. L. Kwok ¢, Syed S. Ahmed 1, Jian-Piao Cai 1, Pyrear S.H. Zhao 1, Jade L.L. Teng 123, 
Stephen K.W. Tsui §, Kwok-Yung Yuen 1734* and Patrick C. Y. Woo 1234* 


1 Department of Microbiology, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Pokfulam, 
Hong Kong, China; skplau@hku.hk (S.K.P.L.); antonwcp@connect.hku.hk (A.C.P.W); hkhluk@hku.hk 
(H.K.H.L.); shakeel87@gmail.com (S.S.A.); caijuice@hku.hk (J.-P.C.); pyrear@126.com (P.S.H.Z.); 
Ilteng@hku.hk (J.L.L.T.) 

2 State Key Laboratory of Emerging Infectious Diseases, The University of Hong Kong, Pokfulam, 

Hong Kong, China 

3 Carol Yu Centre for Infection, The University of Hong Kong, Pokfulam, Hong Kong, China 

* Collaborative Innovation Centre for Diagnosis and Treatment of Infectious Diseases, The University of 
Hong Kong, Pokfulam, Hong Kong, China 

5 Guangdong Key Laboratory of Animal Conservation and Resource Utilization, Guangdong Public 
Laboratory of Wild Animal Conservation and Utilization, Guangdong Institute of Applied Biological 
Resources, Guangzhou 510000, China; zhanglb@giabr.gd.cn 

& School of Biomedical Sciences, Faculty of Medicine, The Chinese University of Hong Kong, Shatin, 
Hong Kong, China; jamie_slk@link.cuhk.edu.hk (J.S.L.K.); kwtsui@cuhk.edu.hk (S.K.W.T.) 

* Correspondence: kyyuen@hku.hk (K.-Y.Y.); pcywoo@hku.hk (P.C.Y.W.); 

Tel.: +852-2255-4892 (K.-Y.Y. & P.C.Y.W.); Fax: +852-2855-1241 (K.-Y.Y. & P.C.Y.W.) 

+ These authors contributed equally to this manuscript. 


Received: 15 April 2019; Accepted: 6 May 2019; Published: 7 May 2019 


Abstract: While bats are increasingly recognized as a source of coronavirus epidemics, the diversity 
and emergence potential of bat coronaviruses remains to be fully understood. Among 1779 bat 
samples collected in China, diverse coronaviruses were detected in 32 samples from five different 
bat species by RT-PCR. Two novel alphacoronaviruses, Rhinolophus sinicus bat coronavirus HKU32 
(Rs-BatCoV HKU32) and Tylonycteris robustula bat coronavirus HKU33 (Tr-BatCoV HKU33), were 
discovered from Chinese horseshoe bats in Hong Kong and greater bamboo bats in Guizhou 
Province, respectively. Genome analyses showed that Rs-BatCoV HKU32 is closely related to 
BatCoV HKU10 and related viruses from diverse bat families, whereas Tr-BatCoV HKU33 is closely 
related to BINv-AlphaCoV and similar viruses exclusively from bats of Vespertilionidae family. The 
close relatedness of Rs-BatCoV HKU32 to BatCoV HKU10 which was also detected in Pomona 
roundleaf bats from the same country park suggests that these viruses may have the tendency of 
infecting genetically distant bat populations of close geographical proximity with subsequent 
genetic divergence. Moreover, the presence of SARSr-CoV ORF7a-like protein in Rs-BatCoV HKU32 
suggests a common evolutionary origin of this accessory protein with SARS-CoV, also from Chinese 
horseshoe bats, an apparent reservoir for coronavirus epidemics. The emergence potential of Rs- 
BatCoV HKU32 should be explored. 
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1. Introduction 


The Severe Acute Respiratory Syndrome (SARS) and more recently the Middle East Respiratory 
Syndrome (MERS) have proven the emergence potential of animal coronaviruses (CoVs) and aroused 
immense interest in the discovery of novel CoVs in animals and humans. While SARS coronavirus 
(SARS-CoV) was originated from horseshoe bats in China as its animal reservoir and transmitted to 
humans after amplification in palm civets from wildlife markets [1,2], dromedary camels in the 
Middle East are the immediate animal source of the MERS epidemic caused by MERS coronavirus 
(MERS-CoV) [3-6]. Bats also harbor MERS-CoV-related viruses, which may suggest a possible bat 
origin [7-15], although the evolutionary origin of MERS-CoV remains to be ascertained. 

Through the discovery of numerous novel CoVs since the SARS epidemic [16-18], bats were 
uncovered as an important animal reservoir for alphacoronaviruses (alphaCoVs) and 
betacoronaviruses (betaCoVs), and birds as an important reservoir for gammacoronaviruses 
(gammaCoVs) and deltacoronaviruses (deltaCoVs) [19-23]. In particular, bats harbor CoVs that can 
evolve to cause epidemics in humans and other animals. When MERS-CoV was first discovered, it 
was most closely related to Tylonycteris bat CoV HKU4 (Ty-BatCoV HKU4) and Pipistrellus bat CoV 
HKU5 (Pi-BatCoV HKUS) that were detected five years ahead of the MERS epidemic, from bats in 
Hong Kong [7-10,24,25]. This illustrates the importance of continuous surveillance studies of bat 
CoVs in preparing for future epidemics in humans. 

Besides SARS-CoV and MERS-CoV, bat CoVs closely related to other human CoVs, including 
human CoV 229E and human CoV NL63, were also recently discovered [26-28], suggesting that bats 
are the important animal source of CoVs that may emerge in humans. On the other hand, bat CoVs 
may also evolve to infect other animals. For example, porcine epidemic diarrhea virus (PEDV) is 
phylogenetically closely related to Scotophilus bat coronavirus 512 (Sc-BatCoV 512), suggesting cross- 
species transmission events between bats and pigs [29]. In 2016-2017, outbreaks of severe watery 
diarrhea were reported in suckling piglets from farms in Guangdong Province, China, which were 
found to be caused by swine acute diarrhea syndrome coronavirus (SADS-CoV) [30-32]. SADS-CoV 
is very close to and likely to have emerged from Rhinolophus bat CoV HKU2 (Rh-BatCoV HKU2), first 
discovered in Hong Kong and detected in a wide range of horseshoe bats including Rhinolophus 
sinicus, Rhinolophus affinis and Rhinolophus ferrumequinum [30,33]. In particular, the spike protein of 
SADS-CoV shared 93-98% amino acid identity to that of Rh-BatCoV HKU2 from Rhinolophus affinis, 
supporting recent interspecies jumping from bats to pigs [30]. 

To further explore the diversity of CoVs in bats and understand the genetic evolution of CoVs, 
we collected bat samples from Hong Kong and mainland China. Diverse CoVs belonging to 
alphaCoVs and betaCoVs were detected, including two novel alphaCoVs, as confirmed by complete 
genome sequencing and characterization, supporting bats as an important reservoir for CoVs. The 
evolutionary relationship of the two novel alphaCoVs to other known CoVs is also discussed. 


2. Materials and Methods 


2.1. Ethics Statement 


Collection of bat samples in Hong Kong was approved by the Department of Agriculture, 
Fisheries and Conservation, Hong Kong Special Administrative Region (HKSAR); and the Committee 
on the Use of Live Animals in Teaching and Research, The University of Hong Kong (CULATR Ref. 
No.: 2284-10 and 3330-14; Date of approval: 23 March 2011 and 17 April 2014). Bat samples from 
mainland China were collected by the Guangdong Institute of Applied Biological Resources 
(Guangzhou, China) in accordance with guidelines of Regulations for Administration of Laboratory 
Animals under a license from Guangdong Entomological Institute Administrative Panel on 
Laboratory Animal Care. 


2.2. Collection of Bat Samples 


Viruses 2019, 1, 5042 3 of 20 


Oral and alimentary samples were collected from bats captured from HKSAR, and Guizhou and 
Guangdong Provinces, mainland China, during 2013-2015 using procedures described previously 
[1]. To prevent cross contamination, sterile disposable swabs with protective gloves were used during 
sample collection and changed between samples. All samples were immediately placed in viral 
transport medium (Earle’s balanced salt solution, 20% glucose, 4.4% NaHCO, 5% bovine albumin, 
vancomycin 50,000 pg/mL, amikacin 50,000 g/mL, nystatin 10,000 units/mL) before transportation 
to the laboratory. For prolonged storage, all samples were stored at —80 °C before further studies. 


2.3. Detection of Bat CoVs by RNA Extraction, RT-PCR and DNA Sequencing 


Viral RNA was extracted form oral and alimentary samples using QIAamp viral RNA minikit 
(Qiagen, Hilden, Germany). Eluted RNA was used as the template for reverse transcription-PCR (RT- 
PCR). Detection of CoV was performed by amplifying a 440-bp fragment of the RNA-dependent RNA 
polymerase (RdRp) gene of CoVs using conserved primers (5’-GGTTGGGACTATCCTAAGTGTGA- 
3’ and 5’-ACCATCATCNGANARDATCATNA-3’) [18]. Reverse transcription was performed using 
a SuperScript III kit (Invitrogen, San Diego, CA, USA). PCR mixture (25 UL) was prepared and PCR 
conditions were set as described previously in an automated thermal cycler (Applied Biosystems) 
[34]. Amplified PCR products were gel-purified using the QIAquick gel extraction kit (QIAgen). Both 
strands of PCR products were sequenced with an ABI Prism 3130x genetic Analyzer (Applied 
Biosystems, Foster City, CA, USA), using the above primers. Comparison of the PCR products’ 
sequences with other known CoVs’ RdRp genes from GenBank sequence database was performed. 


2.4. Viral Culture 


Attempts to isolate Rs-BatCoV HKU32 and Tr-BatCoV HKU33 were performed by inoculating 
samples with RT-PCR positive results to different cells. Viral replication was detected by cytopathic 
effect observation and viral detection of culture supernatant collected from passages by RT-PCR. 


2.5. Complete Genome Sequencing of Rs-BatCoV HKU32 and Tr-BatCoV HKU33 


Viral genomes of two Rs-BatCoV HKU32 (TLC26A and TLC28A) and one Tr-BatCoV HKU33 
(GZ151867) were amplified and sequenced using RNA directly extracted from their alimentary 
samples respectively as templates. Both viral RNAs were reverse transcribed to CDNA by a combined 
random-priming and oligo(dT)-priming strategy. For GZ151867, the amplified cDNA sample was 
barcoded and sequenced using the Ion Torrent sequencing platform. The average sequencing 
throughput of these samples was 12.90 Mbp and the average read length was 150.1 bp. The single- 
end reads were de novo assembled using SPAdes Genome Assembler version 3.10.0 using default 
parameters [35]. Coronavirus-matching contigs were searched using BLASTN version 2.5.0+ against 
NCBI nucleotide database (nt) version downloaded on October 4th 2016 with e-value cutoff at 1 x 10% 
[36]. 

Degenerated primers were designed according to multiple alignments of the genomes of other 
alphaCoVs with complete genomes available, using strategies as described previously [18,20,37]. 
Additional primers were designed based on the results of the first and subsequent rounds of 
sequencing or the NGS sequencing for the sample GZ151867. SMARTer 5’/3’ RACE kit (Clontech, 
Mountain View, CA, USA) was used to perform rapid amplification of CDNA ends and confirm the 
5’ ends of the genomes. For Rs-BatCoV HKU32, a total of 53 sets of primers (available on request) 
were used for PCR. For Tr-BatCoV HKU33, a total of 44 sets of primers (available on request) were 
used for PCR. Sequences were assembled and edited manually to produce complete sequences of the 
three viral genomes. 


2.6. Phylogenetic and Genome Analysis of Rs-BatCoV HKU32 and Tr-BatCoV HKU33 


The genomes of Rs-BatCoV HKU32 and Tr-BatCoV HKU33 were aligned and analyzed with 
other alphaCoVs with complete genome sequences available from Genbank using online sequence 
alignment server MAFFT version 7 [38]. The nucleotide sequences of the genomes and the deduced 
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amino acid sequences of the open reading frames were analyzed and compared with other alphaCoVs 
using ORF finder (https://www.ncbi.nlm.nih.gov/orffinder/). Maximum-likelihood phylogenetic tress 
with 1000 bootstrap replicates of ORFlab and S genes were constructed using PhyMLv3.0 (The French 
Institute of Bioinformatics & France Genomique, Montpellier, France) [39]. Smart Model Selection 
from PhyML was used to calculate the best-fit substitution model for ML analyses. 


2.7. Expression of ORF10 Accessory Gene and Determination of Leader-Body Junction Sequence 


The leader-body junction site and flanking sequences of the ORF10 subgenomic mRNA in Rs- 
BatCoV HKU32 strain TLC28A was sequenced and determined using RT-PCR method as described 
previously. CDNA obtained from RT was used as the template for PCR amplification with a forward 
primer (5’-GCGTCTCATCCCCTCAA-3’) located in the leader sequence and a reverse primer (5’- 
GAACCAGCGATACAATCAATG-3’) located in the body of the ORF10 subgenomic mRNA. PCR 
mixture was prepared as described previously. The mixtures were amplified for 60 cycles of 94 °C for 
1 min, 55 °C for 1 min, and 72 °C for 1 min and a final extension at 72 °C for 10 min. Amplified PCR 
products were subjected to gel purification and sequencing as described previously. 


2.8. Accession Number 


The nucleotide sequences of the three complete genomes of Rs-BatCoV HKU32 and Tr-BatCoV 
HKU33 have been deposited in the GenBank sequence databases with the accession numbers 
MK720944 to MK720946. 


3. Results 


3.1. Bat Coronaviruses Surveillance and Identification of Two Novel Alphacoronaviruses 


A total of 1779 alimentary samples from 1117 bats of 20 species were obtained from Hong Kong 
and Guangdong and Guizhou Provinces in southern China (Figure 1, Table 1). RT-PCR for a 440-bp 
fragment of RdRp gene of CoVs was positive in samples from 32 bats (2.9%) of 5 species belonging to 
4 genera. Sequence analysis showed that 11 samples contained alphaCoVs, 3 contained Sarbecovirus 
(lineage B betaCoVs) and 18 contained Merbecovirus (lineage C betaCoVs). 

7 alphaCoV sequences from Rhinolophus sinicus (Chinese horseshoe bats) captured in Hong Kong 
showed <84% nt identity to the corresponding sequences of BtRfAlphaCoV/HuB2013 (GenBank 
accession no. NC_028814.1) and other alphaCoVs, suggesting a potentially novel alphaCoV proposed 
to be named Rhinolophus sinicus bat coronavirus HKU32 (Rs-BatCoV HKU32) (Table 1). One other 
alphaCoV sequence from Tylonycteris robustula (Greater bamboo bats) captured from Luodian County 
in Guizhou Province showed <81% nt identity to the corresponding sequence of BtNv- 
AlphaCoV/SC2013 (GenBank accession no. NC_028833.1) and other alphaCoVs, suggesting another 
potentially novel alphaCoV proposed to be named Tylonycteris robustula bat coronavirus HKU33 (Tr- 
BatCoV HKU33) (Table 1). Attempts to isolate both Rs-BatCoV HKU32 and Tr-BatCoV HKU33 in 
Vero, VeroE6, RSK (in-house development), RSL (in-house development), HeLa, Caco-2 and HT-29 
cells were unsuccessful. No cytopathic effect or viral replication was detected. 

The other positive bat samples contained known bat alphaCoVs and betaCoVs. Hi-BatCoV 
HKU10 was detected in 2 samples from Hipposideros pomona captured in Hong Kong, with 99% 
nucleotide identity to the corresponding partial RdRp sequence of Hi-BatCoV HKU10 isolate 
TLC1310A (GenBank accession no. JQ989268.1) (Table 1). Myotis daubentonii CoV was detected in 1 
sample from Myotis ricketti captured in Hong Kong, sharing 96% nucleotide identity to Coronavirus 
PREDICT CoV-37 (GenBank accession no. KX285138.1) (Table 1). For betaCoVs, 3 samples from 
Rhinolophus sinicus captured in Guangdong Province contained SARS-related BatCoVs (Sarbecovirus) 
with 99% nucleotide identity to SARS-related BatCoV HKU3-12 (GenBank accession no. GQ153547.1) 
(Table 1). Ty-BatCoV HKU4 (Merbecovirus) was detected in 18 samples of Tylonycteris pachypus 
captured in Guizhou Province, with 95-96% nucleotide identity to Ty-BatCoV HKU4-4 (GenBank 
accession no. EF065508.1) (Table 1). 
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Figure 1. Map of southern China showing locations where bat coronaviruses were found. Black bat 
represents the location with bats positive for Hi-BatCoV HKU10; orange bat represents the location 
with bats positive for coronavirus (CoV) PREDICT CoV-37; blue bat represents the location with bats 
positive for Rs-BatCoV HKU32; green bat represents the location with bats positive for Tr-BatCoV 
HKU33; grey bat represents the location with bats positive for severe acute respiratory syndrome 
related (SARSr) BatCoV; yellow bat represents the location with bats positive for Ty-BatCoV HKU4. 
Provinces where samples were collected are in red font. 
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Table 1. Detection of CoVs in different bat species by reverse transcription (RT)-polymerase chain reaction (PCR) of the 440-bp fragment of RNA-dependent RNA 


polymerase (RdRp) gene. 
Scientific Name Common Name No. of Bats No. of Bats Positive CoV Detected Sampling Location of Bats 
Captured for CoV / (%) 
Cynopterus sphinx Greater short-nosed fruit bat 3 0 - SWH 
Hipposideros armiger Great roundleaf bat 3 0 - GZ 
Hipposideros larvatus Intermediate roundleaf bat 21 0 - GZ 
Hipposideros pomona Pomona leaf-nosed bat 182 2/ (1.1) Hi-BatCoV HKU10 TLC13, GD 
Hypsugo pulveratus Chinese pipistrelle 2 0 - LMHP 
Miniopterus magnater Western bent-winged bat 1 0 - SK01 
Miniopterus pusillus Small bent-wing bat 56 0 - LMH, SWH, SKO1 
Miniopterus schreibersii Common bent-wing bat 23 0 - SK01 
Miniopterus filiginosus Eastern bent-wing bat 1 0 - LMHP 
Myotis chinensis Large myotis 10 0 - SK01, GZ 
Myotis ricketti Rickett’s big-footed bat 93 1/(1.1) Coronavirus PREDICT LMH01,SK01 
CoV-37 
Nyctalus noctula Common noctule 0 - YSO 
Pipistrellus abramus Japanese pipistrelle 0 - MPO, YSO, KKSH 
Pipistrellus tenuis Least pipistrelle 0 - KKSH, YSO, SWH, LMHP 
Rhinolophus affinis Intermediate horseshoe bat 76 0 - TLCO01, TLC13, SKO1 
Rhinolophus pearsonii Pearson’s horseshoe bat 2. 0 - GDP 
Rhinolophus pusillus Least horseshoe bat 17 0 - TLC13 
Rhinolophus sinicus Chinese horseshoe bat 272 10 / (3.7) Rs-BatCoV HKU32 (7) TLCO1, GDP 
SARSr BatCoV (3) 
Tylonycteris pachypus Lesser bamboo bat 240 18 / (7.5) Ty-BatCoV HKU4 WKT, PFL, SWH, TLCO1, GZP 
Tylonycteris robustula Greater bamboo bat 104 1/ (0.96) Tr-BatCoV HKU33 GZP 


GD, Guangdong Province; GDP, Guangdong Province—Conghua City; GZ, Guizhou Province; GZP, Guizhou Province—Luodian County; KKSH, Kai Kuk Shue Ha, 
Luk Keng; LMH, Lin Ma Hang Lead Mine; LMHP, Lin Ma Hang Pool; MPO, Mai Po Nature Reserve; PFL, Pok Fu Lam; SK01, Sai Kung; SWH, Sheung Wo Hang, Sha 
Tau Kok; TLC01, Tai Lam-Shek Kong; TLC13, Tai Lam-Shek Kong; WKT, Wu Kau Tang; YSO, Yung Shue O Stream, Sai Kun. 
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3.2. Genome Features of the Two Novel Alphacoronaviruses, Rs-BatCoV HKU32 and Tr-BatCoV HKU33 


The complete genomes of two strains of Rs-BatCoV HKU32, TLC26A and TLC28A, and Tr- 
BatCoV HKU33 strain GZ151867 were sequenced and determined to characterize their genome 
features. 


3.2.1. Novel alphaCoV Species: Rs-BatCoV HKU32 


Both genomes of Rs-BatCoV HKU32 TLC26A and TLC28A possessed genome sizes of 29201 
nucleotides, with 40.3% G + C content. They shared 99% overall nucleotide identity to each another. 
Rs-BatCoV HKU32 strain TLC28A was selected as the reference strain for the following genomic 
analyses. 

Similar to other alphaCoV genomes, Rs-BatCoV HKU32 consisted of 10 putative open reading 
frames (ORFs) including the essential ORFlab, S, E, Mand N (Table 2). 3 accessory genes were located 
between the S and N genes while 2 accessory genes were found downstream of N gene (Figure 2). 
Rs-BatCoV HKU32 ORF3 accessory protein shared low amino acid identity (31%-51%) to the 
respective accessory proteins in other alphaCoVs while Rs-BatCoV HKU32 ORF10 accessory protein 
shared 29% amino acid identity to SARSr-CoV ORF7a accessory protein. A putative transcription 
regulatory sequence (TRS) motif, 5’-CUAAAC-3’, was identified at the 3’ end of the leader sequence 
and preceded most ORFs except the S, ORF3, ORF5a and ORF9 (Table 2). An alternative TRS motif 
for S and ORF5 genes was found to be 5’-CUAAAU-3’, while that for ORF3 and ORF9 was 5’- 
CUAAAU-3’ and 5’-CUGAAC-3’, respectively (Table 2). The characteristics of putative nonstructural 
protein and predicted putative cleavage sites of Rs-BatCoV HKU32 are shown in Tables 3 and 4. 

Comparative genomic analyses showed that Rs-BatCoV HKU32 shared 58.9% overall nucleotide 
identity with BtRf-AlphaCoV and 67.2% with Hi-BatCoV HKU10. To determine whether Rs-BatCoV 
HKU32 was a novel alphaCoV species, 7 conserved replicase domains of Rs-BatCoV HKU32 were 
selected for analyses according to the CoV species demarcation criteria by the ICTV [23]. Five known 
alphaCoVs with complete genome sequences available and close phylogenetic relationship to Rs- 
BatCoV HKU32 were chosen for comparison. The 7 concatenated domains of Rs-BatCoV HKU32 
shared 83.2%, 83.3%, 78.6% 70.3 and 69.5% amino acid identity with those of BtRf-AlphaCoV, BtMs- 
AlphaCoV, Ro-BatCoV HKU10, PEDV and Tr-BatCoV HKU33, respectively, which were below the 
threshold of 90% amino acid identity (Table 5). The results supported that Rs-BatCoV HKU32 
represents a novel CoV species in the AlphaCoV genus. 


0 5000 10000 15000 20000 25000 30000 35000 


a SE ee ea ae ee 
Sa/b 10 
Rs-BatCoV HKU32 4 ORF la Ss 
strain TLC28A = [sy win 
E 


OhbAe ORF 1b i: 21M ae 


Tr-BatCoV HKU33 4 
strain GZ151867 


Figure 2. Genome organizations of Rs-BatCoV HKU32 strain TLC28A and Tr-BatCoV HKU33 strain 
GZ151867. Genes for ORF 1a and 1b of Rs-BatCoV HKU32 strain TLC28A and Tr-BatCoV HKU33 
strain GZ151867 are represented by red and orange boxes, respectively. Genes for spike protein (S), 
envelope protein (E), membrane protein (M) and nucleocapsid protein (N) are represented by blue 
boxes. Genes for putative accessory proteins are represented by yellow boxes. 


Viruses 2019, 1, 5042 


8 of 20 


Table 2. Coding potential and predicted domains in different proteins of Rs-BatCoV HKU32 strain 


TLC28A. 
Putative TRS 
ORF Nucleotide No. of No. of Frame(s) Nucleotide TRS Sequence (Distance 
Positions Nucleotides Amino Position in (No. of Bases) to AUG) ! 
(Start-End) Acids Genome 
lab 291-20,428 20,137 6712 +2, 43 69 AACUAAAC(216)AUG 
nsp1 291-875 585 195 +3 
nsp2 876-2963 2088 696 +3 
nsp3 2964-7649 4686 1562 +3 
nps4 7650-9083 1434 478 43 
nsp5 9084-9989 906 302 +3 
nsp6 9990-10817 828 276 43 
nsp7 10,818-11,066 249 83 +3 
nsp8 11,067-11,651 585 195 +3, 
nsp9 11,652-11,975 324 108 +3, 
nsp10 11,976-12,383 408 136 +3, 
nsp11 51 17 +3 
nsp12 12,384-15,163 2780 927 +2 
nsp13 15,164-16,954 = 1791 597 +2 
nsp14 16,955-18,508 1554 518 +2 
nsp15 18,509-19,525 1017 339 +2 
nsp16 19,526-20,428 903 300 +2 
S 20,430-24,485 4056 1351 +2 20,421 AACUAAAU(3)AUG 
ORF3 24,485-25,153 669 222 43 24,279 TCCUUAAC(199)AUG 
ORF4 25,184-25,543 360 119 +2 
ORF5a 25,544-25,888 345 114 +2 25,540 GACUAAAUG 
ORF5b 25,782-26,225 444 147 $83 
E 26,209-26,433 225 74 +1 26,140 AACUAAAC(64)AUG 
M 26,440-27,126 687 228 +1 26,430 GTCUAAAC(4)AUG 
N 27,137-28,279 1143 380 +2 27,128 AACUAAAC(3)AUG 
ORF9 28,251-28,568 318 105 43 28,187 AGCUGAAC(58)AUG 
ORF10 28,593-28,955 363 120 43 28,284 AACUAAAC(303)AUG 
(SARS-CoV 
ORF7a-like 
protein) 


1 TRS sequences are shown in bold. 
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Table 3. Characteristics of putative nonstructural proteins of open reading frame (ORF)1ab in Rs-BatCoV HKU32 strain TLC28A, Tr-BatCoV HKU33 strain GZ151867, 
BatCoV HKU10 and BtNv-AlphaCoV/SC2013. 


Amino Acids 


nsp Putative Function or Domain Rs-BatCoV HKU32 Ro-BatCoV HKU10 183A Tr-BatCoV HKU33 BtNv-AlphaCoV/SC2013 
Strain TLC28A Strain GZ151867 
nsp1 Unknown M?— At M!- A 1% M!- AW3 M? — A3 
nsp2 Unknown Pls — G89 K1% — G88s K14 — G71 Ki — G71 
nsp3 ADRP, Putative PLprre G89 a G7453 S889 a G18 G’? a G2339 G’? = G2338 
domains (PL1?r°, PL2P°) 
nsp4 Hydrophobic domain $2454 _ 2931 2519 _ C2996 G2340 — (2817 G2339 — (2815 
nsp5 3CLpre 2932 aa Q283 2997 = Q3298 2818 a Qs119 A28l6 — Qsu7 
nsp6 Hydrophobic domain $3234 a Q9909 $3299 = Qs74 G3!20 a Q3598 $3118 a Q3895 
nsp7 Unknown $3510 ns Q9592 $3575 ~ Q3657 $3399 as Q%481 S3396 = Q#78 
nsp8& Unknown $3593 a (9787 53658 a (3852 3482 = (367% $3479 = Q%75 
nsp9 Unknown NN8788 , Q3895 N3853 — Q9960 N°77 — Q9784 9074 = Q%781 
nsp10 Unknown A3896 Q1031 A391 — Q*097 A3785 — Q%919 A3782 —Q9916 
nsp11 Unknown $4032 — [4048 A4098 — Qts T3920 — [3936 A397 — [3933 
nsp12 RdRp $4032 _ Qt58 A 4098 — Q5024 T3920 — Q*846 Ail7 — O48 
nsp13 Hel A499 — Q»555 $5025 = Qe! $4847 = Qs 4844 = Q5440 
nsp14 ExoN, N7-MTase $5556 = Q073 (A622 _ Qs189 A544 — Q95960 S541 = Q5958 
nsp15 NendoU Go074 = Qe!2 e140 = Qs $5961 = Q99 G59? = Q°97 
nsp16 O-MT A413 — 6712 e479 — R6780 $6300 — Y6591 $6298 — Y6589 
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Table 4. Cleavage site used between nsp in Rs-BatCoV HKU22 strain TLC28A, Tr-BatCoV HKU33 strain GZ151867, BatCoV HKU10 and BtNv-AlphaCoV/SC2013. 


nsp Cleavage Site 
Rs-BatCoV HKU32 Strain TLC28A Ro-BatCoV HKU10 183A Tr-BatCoV HKU33 Strain GZ151867 BtNv-AlphaCoV/SC2013 

nsp1/nsp2 A/P A/K A/K A/K 
nsp2/nsp3 G/G G/S G/G G/G 
nsp3/nsp4 G/S G/S G/G G/G 
nsp4/nsp5 Q/S Q/S Q/S Q/A 
nsp5/nsp6 Qs Q/s Q/G Qs 
nsp6/nsp7 Qs Q/s Q/s Qs 
nsp7/nsp8 Q/S Q/S Q/S Q/S 
nsp8/nsp9 Q/N Q/N Q/N Q/N 
nsp9/nsp10 Q/A Q/A Q/A Q/A 
nsp10/nsp12 Q/S Q/A Q/T Q/A 
nsp12/nsp13 Q/A Q/S Q/s Q/S 
nsp13/nsp14 Q/S Q/A Q/A Q/S 
nsp14/nsp15 Q/G Q/S Q/S Q/G 


nsp15/nsp16 Q/A Q/S Q/S Q/S 
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Table 5. Pairwise comparison of Coronaviridae-wide conserved domains in replicase polyprotein lab and overall replicase polyprotein lab between Rs-BatCoV HKU32 
strain TLC28A and other alphaCoVs. 


Replicase 
Polyprotein Domain 


Pairwise Sequence Identity with Rs-BatCoV HKU32 Strain TLC28A (%) 


BtRf-AlphaCoV/ 
HuB2013 


BtMs-AlphaCoV/ 
GS2013 


Ro-BatCoV HKU10 


PEDV 


Tr-BatCoV HKU33 strain 


GZ151867 


nsp3 

nsp5 

nsp12 

nsp13 

nsp14 

nsp15 

nsp16 

7 Concatenated Domains 
Overall replicase pplab 


67.6 
84.8 
92.6 
94.1 
93.4 
89.4 
89.7 
83.2 
80.1 


67.6 
84.8 
92.6 
94.3 
93.4 
89.4 
90.0 
83.3 
80.5 


60.3 
81.5 
90.1 
92.1 
90.0 
83.8 
85.8 
78.6 
75.0 


50.1 
74.2 
83.2 
85.6 
79.9 
76.7 
82.8 
70.3 
65.8 


49.0 
75.2 
83.7 
80.9 
78.8 
78.2 
81.2 
69.5 
63.3 
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3.2.2. Novel alphaCoV Species: Tr-BatCoV HKU33 


Tr-BatCoV HKU33 strain GZ151867 possessed a smaller genome size of 27,636 nucleotides 
compared to Rs-BatCoV HKU32 with 37% G + C content. It possessed 7 putative ORFs with only 2 
accessory genes, ORF3 and ORF7 (Figure 2). The putative TRS motif of Tr-BatCoV HKU33 was 5’- 
CUAAAC-3’ and preceded ORF lab, E, M and N. An alternative TRS motif for S and ORF7 genes was 
found to be 5’-CUAAAU-3’ while the alternative TRS motif for ORF3 and E was 5’-CUCAAC-3’ 
(Table 6). The characteristics of putative nonstructural proteins and predicted putative cleavage sites 
of Tr-BatCoV HKU33 were observed (Tables 3 and 4). 

Comparative genomic analyses showed that Tr-BatCoV HKU33 had the highest genome 
similarity with BtNv-AlphaCoV, sharing 69.4% overall nucleotide identity. Pairwise comparison of 
the 7 conserved replicase domains of Tr-BatCoV HKU33 indicated that Tr-BatCoV HKU33 shared 
76.3% and 74.4% amino acid identity with its closest relatives BtNv-AlphaCoV and AlphaCoV 
BatCoV/P.kuhlii/Italy206679-3/2010, respectively, suggesting Tr-BatCoV HKU33 represents another 
novel CoV species in the AlphaCoV genus (Table 7). 


Table 6. Coding potential and predicted domains in different proteins of Tr-BatCoV HKU33 strain 


GZ151867. 
Putative TRS 
ORF Nucleotide No. of No. of Frame(s) Nucleotide TRS Sequence (Distance 
Positions Nucleotid Amino Positionin (No. of Bases) to AUG) ! 
(Start-End) es Acids Genome 
lab 278-20,052 19,774 6591 +1, +2 54 AACUAAAC(218)AUG 
nsp1 278-856 579 193 +2 
nsp2 857-2590 1734 578 +2 
nsp3 2591-7294 4704 1568 +2 
nps4 7295-8728 1434 478 +2 
nsp5 8729-9634 906 302 +2 
nsp6 9635-10,471 837 279 +2 
nsp7 10,472-10,532 249 83 +2 
nsp8& 10,533-11,305 585 195 +2 
nsp9 11,306-11,629 324 108 +2 
nsp10 11,630-12,034 405 135 +2 
nsp11 51 17 +2 
nsp12 12,035-14,814 2780 927 +1 
nsp13 14,815-16,605 1791 597 +1 
nsp14 16,606-18,156 1551 517 +1 
nsp15 18,157-19,173 1017 339 +1 
nsp16 19,174-20,052 879 292 +1 
S 20,053-24,150 4098 1365 +1 20,049 GACUAAAUG 
ORF3 24,150-24,755 606 201 +3 23,876 ATCUCAAC(268)AUG 
E 24,777-25,004 228 75 +3 24,763 TTCUCAAC(8)AUG 
M 25,011-25,697 687 228 +3 25,001 GTCUAAAC(4)AUG 
N 25,706-26,977 1272 423 +2 25,699 AACUAAAC(1)AUG 
ORF7 26,989-27,348 360 119 +1 26,982 AACUAAAU(1)AUG 


1 TRS sequences are shown in bold. 
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Table 7. Pairwise comparison of Coronaviridae-wide conserved domains in replicase polyprotein lab and overall replicase polyprotein lab between Tr-BatCoV HKU33 
strain GZ151867 and other alphaCoVs. 


Replicase Polyprotein 


Pairwise Amino Acid Sequence Identity with Tr-BatCoV HKU33 Strain GZ151867 (%) 


Domain BtNv-AlphaCoV/SC2013  BtRf-AlphaCoV/HuB2013  BtMs-AlphaCoV/GS2013 Rs-BatCoV HKU32 
BatCoV/P.kuhlii/Italy Strain TLC28A 
206679-3/2010 
nsp3 58.8 48.7 48.8 57.3 49.0 
nsp5 82.5 74.2 74.5 78.8 75.2 
nsp12 86.3 83.5 83.6 86.4 83.7 
nsp13 87.0 79.7 80.1 82.7 80.9 
nsp14 84.0 79.3 79.3 82.4 78.8 
nsp15 85.8 79.6 79.0 84.7 78.2 
nsp16 87.3 80.8 80.5 84.2 81.2 
7 Concatenated Domains 76.3 69.2 69.1 74.4 69.4 
Overall replicase pplab 73.4 62.9 63.0 71.6 63.3 
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3.3. Phylogenetic Analyses 


Phylogenetic trees were constructed using the amino acid sequences of ORFlab and S proteins 
of Rs-BatCoV HKU32, Tr-BatCoV HKU33 and other alphaCoVs as shown in Figures 3 and 4, For 
ORFlab, Rs-BatCoV HKU32 formed a cluster with BtRf-AlphaCoV, BtMs-AlphaCoV, Ro-BatCoV 
HKU10 and Hi-BatCoV HKU10, being more closely related to BtRf-AlphaCoV and BtMs-AlphaCoV 
than to Ro-BatCoV HKU10 and Hi-BatCoV HKU10. Tr-BatCoV HKU33 was most closely related to 
and formed another cluster with BtNv-AlphaCoV, AlphaCoV BatCoV/P.kuhlii/Italy/206645-41/2011 
and AlphaCoV BatCoV/P.kuhlii/Italy/206679-3/2010, but as an outlier branch at the root of this cluster 
(Figure 3). 

In the S gene, Rs-BatCoV HKU32 and Tr-BatCoV HKU33 formed two similar clusters with 
related alphaCoVs but showed a different phylogenetic positioning compared to that in ORFlab 
(Figure 4). Phylogenetically, Rs-BatCoV HKU32 was more closely related to Hi-BatCoV HKU10, and 
Ro-BatCoV HKU10 than to BtRf-AlphaCoV and BtMs-AlphaCoV. On the other hand, Tr-BatCoV 
HKU33 formed an inner branch within the cluster with BtNv-AlphaCoV, AlphaCoV 
BatCoV/P.kuhlii/Italy/206645-41/2011 and AlphaCoV BatCoV/P.kuhlii/Italy/206679-3/2010. 

It is interesting to note that the AlphaCoV cluster formed by Rs-BatCoV HKU32, BatCoV HKU10, 
BtRs-AlphaCoV and BtMs-AlphaCoV was detected from diverse bat hosts from different bat families 
including Pteropodidae, Hipposideridae, Rhinolophidae and Miniopteridae. In contrast, the other cluster 
formed by Tr-BatCoV HKU33, BtNv-AlphaCoV, AlphaCoV BatCoV/P.kuhlii/Italy/206645-41/2011 
and AlphaCoV BatCoV/P.kuhlii/Italy/206679-3/2010 detected from bats belonging to a single bat 
family, Vespertilionidae (Figure 3). 


4000 Camel AlphaCoV Camel229E 
O R F 1 a b 1000} |! Alpaca Respiratory CoV 
yooo, HCoV 229E 
1000 229E-related BatCoV 
-—_———————— HCoV NL63 


1000 ___________ BtK YNL63 
1000 - Sc-BatCoV 512 
1000 (———""" iaaereetinta sp. 
$< PEDY 
BtCoV CDPHE15 


1000 ; Ro-BatCoV HKU10 
Hi-BatCoV HKU10 


Vespertilionidae 1000 ;A Rs-BatCoV HKU32 strain TLC26A 
A Rs-BatCoV HKU32 strain TLC28A 
Miniopteridae BtRf-AlphaCov 
a 5 1000 | BtMs-AlphaCoV 
Rhinolophidae sees BtNv-AlphaCov 
, er AlphaCoV BatCoV/P.kuhlii/Italy/206645-41/2011 
Hipposideridae AlphaCoV BatCoV/P.kuhlii/Italy/206679-3/2010 
Pteropodidae @ Tr-BatCoV HKU33 strain GZ151867 


a8 Mi-BatCoV 1 


Mi-BatCoV HKU8 
BtMr-AlphaCoV 

{ Rh-BatCoV HKU2 
1000 | SADS-CoV 


Wencheng Sm shrew CoV 


Figure 3. Phylogenetic analysis of ORFlab amino acid sequences of Rs-BatCoV HKU32 strains 
TLC26A and 28A, Tr-BatCoV HKU33 strain GZ151867 and other alphaCoVs. ORFlab tree was 
constructed by maximum likelihood method using LG + G +11 + F substitution model. The bootstrap 
values are calculated from 1000 trees. Tree was rooted using corresponding sequence of Middle East 
respiratory syndrome (MERS)-CoV (GenBank accession number YP_009047202.1). All bootstrap 
values are shown. The scale bar represents 5 substitutions per site. Both Rs-BatCoV HKU32 and Tr- 
BatCoV HKU33 are labeled with red (triangle) and blue (circle), respectively. Corresponding viral bat 
hosts’ families are highlighted in different colors: Red, Vespertilionidae; blue, Miniopteridae; yellow, 
Rhinolophidae; purple, Hipposideridae; green, Pteropodidae. 
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FRCoV 
MCoV 

TGEV 

PEDV 

Sc-BatCoV 512 
Alphacoronavirus sp. 
BtCoV CDPHE15 

4000 ; Camel AlphaCoV Camel229E 
Alpaca Respiratory CoV 
HCoV 229E 

229E-related BatCoV 
HCoV NL63 


BtKYNL63 J 
Mi-BatCoV HKU8 

1000 ;s Rs-BatCoV HKU32 strain TLC28A 
ARs-BatCoV HKU32 strain TLC26A 
Hi-BatCoV HKU10 
Ro-BatCoV HKU10 
BtRf-AlphaCoV 
BtMs-AlphaCoV 
Mi-BatCoV 1 
BtMr-AlphaCoV 


1000 AlphaCoV BatCoV/P.kuhlii/Italy/206645-41/2011 
so2ofL— atnw.aipnacov 
@ Tr-BatCoV HKU33 strain GZ151867 
AlphaCoV BatCoV/P. kuhlii/Italy/206679-3/2010 


1000 — Rh-BatCoV HKU2 YW” 
= L— saps.cov | re 


0.5 


LNRV 
Wencheng Sm shrew CoV 


1000 


Figure 4. Phylogenetic analysis of S amino acid sequences of Rs-BatCoV HKU32 strains TLC26A and 
28A, Tr-BatCoV HKU33 strain GZ151867 and other alphaCoVs. S tree was constructed by maximum 
likelihood method using WAG + G +1+ F substitution model. The bootstrap values are calculated 
from 1000 trees. Tree was rooted using corresponding sequence of MERS-CoV (GenBank accession 
number YP_009047204.1). All bootstrap values are shown. The scale bar represents 2 substitutions per 
site. Both Rs-BatCoV HKU32 and Tr-BatCoV HKU33 are highlighted in red (triangle) and blue (circle), 
respectively. Corresponding viral hosts are shown on the right. 


3.4. Homologous SARSr-CoV ORF7a-Like Accessory Protein in Rs-BatCoV HKU32 


A homologous SARSr-CoV ORF7a-like accessory protein, ORF10, was found in Rs-BatCoV 
HKU32, located at nucleotide position 28593 to 28955 with 120 amino acids. The ORF7a accessory 
gene (also known as X4) found in SARSr-CoVs from both human and animal sources is a type I 
transmembrane protein [40]. InterProScan analysis showed that the Rs-BatCoV HKU32 ORF10 
protein also possessed four domains, an N-terminal signal peptide, followed by a luminal domain, 
transmembrane segment and cytoplasmic tail. Rs-BatCoV HKU32 ORF10 protein possessed 29% 
amino acid identity to SARSr-CoV ORF7a protein. 

While the function of SARSr-CoV ORF7a protein remained to be elucidated, studies have shown 
that this accessory protein was expressed during SARS-CoV replication. ORF7a protein was shown 
to exit the endoplasmic reticulum through COPII transport machinery and is targeted to the Golgi 
apparatus [41]. To determine if ORF10 accessory gene is expressed in Rs-BatCoV HKU32, the leader- 
body junction sites and flanking sequences of ORF10 subgenomic mRNA were determined (Figure 
5). The subgenomic mRNAs was successfully amplified and sequenced directly from both samples 
TLC26A and TLC28A. The sequences were aligned to the leader sequence, which confirmed 5’- 
CUAAAC-3’ as the core sequence of the TRS motif. The leader TRS and subgenomic mRNA of ORF10 
accessory gene exactly matched each other. 
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Leader GCGUCUCAUCCCCUCAACUAAACGAAAUUUUUCUCUCCGUC 
ORF10 mRNA GCGUCUCAUCCCCUCAACUAAAC(303)AUGCGGAUUUUAAUC 
Genome CCUUCGAAGCGUGAUCAACUAAAC(303)AUGCGGAUUUUAAUC 


Figure 5. Rs-BatCoV HKU32 strain TLC28A mRNA leader-body junction and flanking sequences. The 
subgenomic ORF10 mRNA sequence are shown in alignment with the leader and genomic sequences. 
Identical nucleotides between the leader sequence and subgenomic mRNA sequence are labeled in 
green. Identical nucleotides between genome and subgenomic mRNA sequence are labeled in blue. 
The putative TRS is labeled in bold type, in purple and is underlined. Start codon AUG is labeled in 
red. 


4. Discussion 


In this study, two novel alphaCoVs, Rs-BatCoV HKU32 from Rhinolophus sinicus in Hong Kong 
and Tr-BatCoV HKU33 from Tylonycteris robustula in Guizhou Province, were discovered. Their 
classification as novel species within the genus AlphaCoV is supported by the results from pairwise 
comparison of their 7 concatenated domains to those of known alphaCoVs based on the CoV species 
demarcation criteria by the ICTV. Phylogenetically, Rs-BatCoV HKU32 is most closely related to BtRf- 
AlphaCoV, BtMs-AlphaCoV, Ro-BatCoV HKU10 and Hi-BatCoV HKU10. On the other hand, Tr- 
BatCoV HKU33 is most closely related to BtNv-AlphaCoV, AlphaCoV BatCoV/P.kuhlii/Italy/206645- 
41/2011 and AlphaCoV BatCoV/P.kuhlii/Italy/206679-3/2010. Nevertheless, different phylogenetic 
positioning with closely related alphaCoVs was observed between ORFlab and the S gene, 
suggesting that the S gene may have evolved in a path different from other genome regions. The S 
protein of CoVs is responsible for receptor recognition and contains epitopes for neutralizing 
antibodies, and hence is often subjected to selective pressure. Therefore, its evolutionary path may be 
different from that of other genome regions. For example, we have previously found that another bat 
alphaCoV, Rs-BatCoV HKU2, contains an evolutionarily distinct spike protein which is only distantly 
related to other alphaCoVs [33]. 

The close phylogenetic relationship between Rs-BatCoV HKU32 and BatCoV HKU10 may be 
explained by the geographical proximity of their hosts. Besides Rs-BatCoV HKU32 and Tr-BatCoV 
HKU33, diverse alphaCoVs and betaCoVs were also detected in our bat samples. Hi-BatCoV HKU10 
and Coronavirus PREDICT CoV-37, belonging to alphaCoVs, were detected in bats in Hong Kong, 
while SARSr-BatCoVs and Ty-BatCoV HKU4, belonging to lineage B (Sarbecovirus) and C 
(Merbecovirus) betaCoVs, respectively, were detected in bats in Guangdong and Guizhou provinces. 
It is of note that Rs-BatCoV HKU32 and BatCoV HKU10 were detected in bats of different families 
but captured from the same sampling location at a country park in Hong Kong. This suggests that 
these viruses may have evolved among geographically close but phylogenetically distant bat 
populations. We have also previously described interspecies transmission of BatCoV HKU10 
between Rousettus leschenaultii and Hipposideros pomona bats which belonged to two different bat 
families [42]. Given that the genetic cluster formed by Rs-BatCoV HKU32, BatCoV HKU10, BtRs- 
AlphaCoV and BtMs-AlphaCoV were detected from diverse bat families, this suggests that these 
alphaCoVs may have a tendency for cross-species transmission. This is in contrast to Tr-BatCoV 
HKU33 and related viruses which were limited to the family Vespertilionidae. Greater bamboo bats, 
the host of Tr-BatCoV HKU33, and other Vesper bats are geographically widespread and able to 
occupy distinct and diverse habitats [43]. Interestingly, MERS-CoV-related bat lineage C betaCoVs 
(Merbecovirus) have also been exclusively detected in the family Vespertilionidae [10,11,13-15,44,45]. 
Further studies are needed to understand the mechanism for host tropism, especially the stringency 
of host receptor usage by different alphaCoVs, the interspecies transmissibility of coronaviruses 
among different bats and the contribution of bat ecology to this phenomenon. 

The presence of a homologous SARSr-CoV ORF7a-like protein in Rs-BatCoV HKU32 suggests a 
common evolutionary origin of this accessory protein from viruses in Chinese horseshoe bats 
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(Rhinolophus sinicus). While they share the same host specificity for Chinese horseshoe bats, Rs- 
BatCoV HKU32 is an alphaCoV while SARSr-CoV is a lineage B betaCoV (Sarbecovirus). While the 
two proteins in SARSr-CoVs and Rs-BatCoV HKU32 share only 29% amino acid identity, they both 
possess typical conserved domains of a type I transmembrane protein. Moreover, this ORF10 (SARSr- 
CoV ORF7a-like) gene is shown to be expressed in Rs-BatCoV HKU32, suggesting it may play a role 
in viral replication. Besides SARSr-CoVs and Rs-BatCoV HKU32, another alphaCoV, Rs-BatCoV 
HKU2 has also been detected in this horseshoe bat species which is the origin of the SARS epidemic. 
Interestingly, Rs-BatCoV HKU2 was also recently found to have evolved and emerged in swine 
population causing epidemics in China, supporting Chinese horseshoe bats as an important animal 
source for CoV epidemics in both humans and animals [30-32]. This horseshoe bat species, which 
ranges from northern India to southern China, often resides in caves or any man-made cave-like 
structures such as abandoned tunnels in Hong Kong [43]. It is therefore important to preserve their 
natural habitats and avoid contact with these bats. Besides SARS-CoV and MERS-CoV, HCoV 229E, 
an alphaCoV, is also likely originated from bats approximately 219 to 333 years ago (Figure 4) [28], 
suggesting that alphaCoVs, like betaCoVs, are also the potential source for new epidemics in humans. 
Further studies are required to elucidate the function of SARSr-CoV ORF7a protein and the 
emergence potential of Rs-BatCoV HKU32. 
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Abbreviations 


AlphaCoV Alphacoronavirus 


BetaCoV Betacoronavirus 

bp Base-pair 

CoVs Coronaviruses 

DeltaCoV Deltacoronavirus 

E Envelope 

GammaCoV = Gammacoronavirus 

HCoV Human coronavirus 

Hi Hipposideros 

ICTV International Committee on Taxonomy of Viruses 
M Membrane 

MERS-CoV Middle East Respiratory Syndrome coronavirus 
N Nucleocapsid 

NCBI National Center for Biotechnology Information 
ORF Open reading frame 

PCR Polymerase chain reaction 

Pi Pipistrellus 

Pp Polyprotein 

RdRp RNA-dependent RNA polymerase 

Ro Rousettus 


RSK Rhinolophus sinicus kidney 
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RSL Rhinolophus sinicus lung 
RT Reverse transcription 
S Spike 
SADS-CoV Swine Acute Diarrhea Syndrome coronavirus 
SARSr-CoV _ Severe Acute Respiratory Syndrome related coronavirus 
TRS Transcription regulatory sequence 
Ty Tylonycteris 
uL Microliter 
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